Mapping Morphological and Phonetic Features of Catalan: a General Template for Contemporary Atlases and Corpus

نویسندگان

  • Maria-Pilar Perea
  • Tomás Navarro Tomás
چکیده

In Catalonia, from a general point of view and concerning Geolinguistics, three assessments can be done: a) no new initiatives for creating a general linguistic atlas are expected; on the contrary, the tendency would be to create regional or local atlases or, disregarding cartography, to develop of monographs concerning several linguistic aspects of a certain dialectal area; b) there is no perceived need for an electronic publication of the atlas or the release of an internet version (the general format used is paper); and c) there is a possibility of computerising the data contained in old atlases. The main aim of this paper is to describe the processes of systematisation and mapping of dialectal data based on “La flexió verbal en els dialectes catalans”. The paper is structured in five parts: a) The corpus of morphological and phonetic data; b) Mapping the data; c) Using the program; d) Sound maps; e) Conclusions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ClInt: a Bilingual Spanish-Catalan Spoken Corpus of Clinical Interviews

In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic and extralinguistic encoding. This is a previously non-existent resource for these languages and i...

متن کامل

Frequency analysis of phonetic units for concatenative synthesis in catalan

Knowledge of phonetic unit frequency is very necessary for developing databases in both concatenative synthesis and continuous speech recognition. In the present work, a large corpus of text was processed and phonetically transcribed to obtain allophone and diphone frequencies for the Catalan language. The corpus was acquired from newspaper articles, in which there were a lot of foreign words t...

متن کامل

Catalan Geolinguistics and New Technical Procedures

New technologies are helping researchers to apply new methods to the treatment of dialectal data, accomplishing a variety of research objectives in the stages of data compilation, data processing and the presentation of results. In this regard, dialectology has at least two aspects: a) obtaining new data to learn contemporary linguistic variation; and b) retrieving earlier material to facilitat...

متن کامل

A Phonetic-Based Approach to Chinese Chat Text Normalization

Chatting is a popular communication media on the Internet via ICQ, chat rooms, etc. Chat language is different from natural language due to its anomalous and dynamic natures, which renders conventional NLP tools inapplicable. The dynamic problem is enormously troublesome because it makes static chat language corpus outdated quickly in representing contemporary chat language. To address the dyna...

متن کامل

The Assessment of Pragmatic Knowledge in the Online General IELTS-Practice Resources: A Corpus Analysis of Writing Tasks

Motivated by the concept of Communicative Language Ability and the eminence of the IELTS exam, this study intended to scrutinize the representation of functional knowledge (FK) and socio-linguistic knowledge (SK) as sub-components of pragmatic knowledge in the writing performances of both tasks of the online General IELTS-practice resources across three band scores. This quantitative inter-scor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010